Generalizing Local Translation Models
نویسنده
چکیده
We investigate translation modeling based on exponential estimates which generalize essential components of standard translation models. In application to a hierarchical phrasebased system the simplest generalization allows its models of lexical selection and reordering to be conditioned on arbitrary attributes of the source sentence and its annotation. Viewing these estimates as approximations of sentence-level probabilities motivates further elaborations that seek to exploit general syntactic and morphological patterns. Dimensionality control with `1 regularizers makes it possible to negotiate the tradeoff between translation quality and decoding speed. Putting together and extending several recent advances in phrase-based translation we arrive at a flexible modeling framework that allows efficient leveraging of monolingual resources and tools. Experiments with features derived from the output of Chinese and Arabic parsers and an Arabic lemmatizer show significant improvements over a strong baseline.
منابع مشابه
Generalizing Local and Non-Local Word-Reordering Patterns for Syntax-Based Machine Translation
Syntactic word reordering is essential for translations across different grammar structures between syntactically distant languagepairs. In this paper, we propose to embed local and non-local word reordering decisions in a synchronous context free grammar, and leverages the grammar in a chartbased decoder. Local word-reordering is effectively encoded in Hiero-like rules; whereas non-local word-...
متن کاملGeneralizing Word Lattice Translation
Word lattice decoding has proven useful in spoken language translation; we argue that it provides a compelling model for translation of text genres, as well. We show that prior work in translating lattices using finite state techniques can be naturally extended to more expressive synchronous context-free grammarbased models. Additionally, we resolve a significant complication that non-linear wo...
متن کاملStatistical Machine Translation by Generalized Parsing
Designers of statistical machine translation (SMT) systems have begun to employ tree-structured translation models. Systems involving tree-structured translation models tend to be complex. This article aims to reduce the conceptual complexity of such systems, in order to make them easier to design, implement, debug, use, study, understand, explain, modify, and improve. In service of this goal, ...
متن کاملGeneralized Parsers for Machine Translation
Designers of statistical machine translation (SMT) systems have begun to employ treestructured translation models. Systems involving tree-structured translation models tend to be complex. This article aims to reduce the conceptual complexity of such systems, in order to make them easier to design, implement, debug, use, study, understand, explain, modify, and improve. In service of this goal, t...
متن کاملTOP LOCAL COHOMOLOGY AND TOP FORMAL LOCAL COHOMOLOGY MODULES WITH SPECIFIED ATTACHED PRIMES
Let (R,m) be a Noetherian local ring, M be a finitely generated R-module of dimension n and a be an ideal of R. In this paper, generalizing the main results of Dibaei and Jafari [3] and Rezaei [8], we will show that if T is a subset of AsshR M, then there exists an ideal a of R such that AttR Hna (M)=T. As an application, we give some relationships between top local cohomology modules and top f...
متن کامل